SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Yokoi, Haruhiko 

Shiotsu, Yukimasa 
Konishi, Noboru 

(ii) TITLE OF INVENTION: NOVEL POLYPEPTIDES 

(iii) NUMBER OF SEQUENCES: 4 6 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: NIXON & VANDERHYE P.C. 

(B) STREET: 1100 North Glebe Rd. 8th floor 

(C) CITY: Arlington 

(D) STATE: VA 

(E) COUNTRY: USA 

(F) ZIP: 22201-4741 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 09/680,514 

(B) FILING DATE: 6-OCT-2000 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/765,337 

(B) FILING DATE: 23-DEC-1996 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/ JP96/01157 

(B) FILING DATE: 26-APR-1996 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: JP P . HEI . 7-102 62 5 

(B) FILING DATE: 26-APR-1995 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Crawford, Arthur R. 

(B) REGISTRATION NUMBER: 25,327 

(C) REFERENCE/ DOCKET NUMBER: 249-89 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 703-816-4000 

(B) TELEFAX: 703-816-4100 




(2) INFORMATION FOR SEQ ID NO : 1 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Ser Pro Ala Pro Pro Ala Cys Asp Leu Arg Val Leu Ser Lys Leu Leu 
15 10 15 

Arg Asp Ser His Val Leu His Ser Arg Leu Ser Gin Cys Pro Glu Val 
20 25 30 

His Pro Leu Pro Thr Pro Val Leu Leu Pro Ala Val Asp Phe Ser Leu 
35 40 45 

Gly Glu Trp Lys Thr Gin Met Glu Glu Thr Lys Ala Gin Asp lie Leu 
50 55 60 

Gly Ala Val Thr Leu Leu Leu Glu Gly Val Met Ala Ala Arg Gly Gin 
65 70 75 80 

Leu Gly Pro Thr Cys Leu Ser Ser Leu Leu Gly Gin Leu Ser Gly Gin 
85 90 95 

Val Arg Leu Leu Leu Gly Ala Leu Gin Ser Leu Leu Gly Thr Gin Leu 
100 105 110 

Pro Pro Gin Gly Arg Thr Thr Ala His Lys Asp Pro Asn Ala lie Phe 
115 120 125 

Leu Ser Phe Gin His Leu Leu Arg Gly Lys Val Arg Phe Leu Met Leu 
130 135 140 

Val Gly Gly Ser Thr Leu Cys Val Arg Arg Ala Pro Thr Tyr Arg Ala 
145 150 155 160 

Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys Ser Leu Glu Gin Val Arg 
165 170 175 

Lys lie Gin Gly Asp Gly Ala Ala Leu Gin Glu Lys Leu Cys Ala Thr 
180 185 190 

Tyr Lys Leu Cys His Pro Glu Glu Leu Val Leu Leu Gly His Ser Leu 
195 200 205 

Gly lie Pro Trp Ala Pro Leu Ser Ser Cys Pro Ser Gin Ala Leu Gin 
210 215 220 



Leu Ala Gly Cys Leu Ser Gin Leu His Ser Gly Leu Phe Leu Tyr Gin 
225 230 235 240 



Gly Leu Leu Gin Ala Leu Glu Gly lie Ser Pro Glu Leu Gly Pro Thr 
245 250 255 



Leu Asp Thr Leu Gin Leu Asp Val Ala Asp Phe Ala Thr Thr lie Trp 
260 265 270 

Gin Gin Met Glu Glu Leu Gly Met Ala Pro Ala Leu Gin Pro Thr Gin 
275 280 285 

Gly Ala Met Pro Ala Phe Ala Ser Ala Phe Gin Arg Arg Ala Gly Gly 
290 295 300 

Val Leu Val Ala Ser His Leu Gin Ser Phe Leu Glu Val Ser Tyr Arg 
305 310 315 320 

Val Leu Arg His Leu Ala Gin Pro 
325 

INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 340 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DE, 

Ser Pro Ala Pro 
1 

Arg Asp Ser His 
20 

His Pro Leu Pro 
35 

Gly Glu Trp Lys 
50 

Gly Ala Val Thr 
65 

Leu Gly Pro Thr 



Val Arg Leu Leu 
100 

Pro Pro Gin Gly 
115 



ICRIPTION: SEQ T 

Pro Ala Cys Asp 
5 

Val Leu His Ser 



Thr Pro Val Leu 
40 

Thr Gin Met Glu 
55 

Leu Leu Leu Glu 
70 

Cys Leu Ser Ser 
85 

Leu Gly Ala Leu 



Arg Thr Thr Ala 
120 



) NO: 2: 

Leu Arg Val Leu 
10 

Arg Leu Ser Gin 
25 

Leu Pro Ala Val 



Glu Thr Lys Ala 
60 

Gly Val Met Ala 
75 

Leu Leu Gly Gin 
90 

Gin Ser Leu Leu 
105 

His Lys Asp Pro 



Ser Lys Leu Leu 
15 

Cys Pro Glu Val 
30 

Asp Phe Ser Leu 
45 

Gin Asp lie Leu 



Ala Arg Gly Gin 
80 

Leu Ser Gly Gin 
95 

Gly Thr Gin Leu 
110 

Asn Ala lie Phe 
125 



Leu Ser Phe Gin His Leu Leu Arg Gly Lys Val Arg Phe Leu Met Leu 
130 135 140 



Val Gly Gly Ser Thr Leu Cys Val Arg Gly Gly Gly Ser Gly Gly Gly 
145 150 155 160 

Ser Gly Gly Gly Ser Arg Ala Pro Thr Tyr Arg Ala Ser Ser Leu Pro 
165 170 175 

Gin Ser Phe Leu Leu Lys Ser Leu Glu Gin. Val Arg Lys lie Gin Gly 
180 185 190 

Asp Gly Ala Ala Leu Gin Glu Lys Leu Cys Ala Thr Tyr Lys Leu Cys 
195 200 205 

His Pro Glu Glu Leu Val Leu Leu Gly His Ser Leu Gly lie Pro Trp 
210 215 220 

Ala Pro Leu Ser Ser Cys Pro Ser Gin Ala Leu Gin Leu Ala Gly Cys 
225 230 235 240 

Leu Ser Gin Leu His Ser Gly Leu Phe Leu Tyr Gin Gly Leu Leu Gin 
245 250 255 

Ala Leu Glu Gly lie Ser Pro Glu Leu Gly Pro Thr Leu Asp Thr Leu 
260 265 270 

Gin Leu Asp Val Ala Asp Phe Ala Thr Thr lie Trp Gin Gin Met Glu 
275 280 285 

Glu Leu Gly Met Ala Pro Ala Leu Gin Pro Thr Gin Gly Ala Met Pro 
290 295 300 

Ala Phe Ala Ser Ala Phe Gin Arg Arg Ala Gly Gly Val Leu Val Ala 
305 310 315 320 

Ser His Leu Gin Ser Phe Leu Glu Val Ser Tyr Arg Val Leu Arg His 
325 330 335 

Leu Ala Gin Pro 
340 

INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 344 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



Ser Pro Ala Pro Pro Ala Cys Asp Leu Arg Val Leu Ser Lys Leu Leu 
15 10 15 

Arg Asp Ser His Val Leu His Ser Arg Leu Ser Gin Cys Pro Glu Val 
20 25 30 

His Pro Leu Pro Thr Pro Val Leu Leu Pro Ala Val Asp Phe Ser Leu 
35 40 45 

Gly Glu Trp Lys Thr Gin Met Glu Glu Thr Lys Ala Gin Asp lie Leu 
50 55 60 

Gly Ala Val Thr Leu Leu Leu Glu Gly Val Met Ala Ala Arg Gly Gin 
65 70 75 80 

Leu Gly Pro Thr Cys Leu Ser Ser Leu Leu Gly Gin Leu Ser Gly Gin 
85 90 95 

Val Arg Leu Leu Leu Gly Ala Leu Gin Ser Leu Leu Gly Thr Gin Leu 
100 105 110 

Pro Pro Gin Gly Arg Thr Thr Ala His Lys Asp Pro Asn Ala lie Phe 
115 120 125 

Leu Ser Phe Gin His Leu Leu Arg Gly Lys Val Arg Phe Leu Met Leu 
130 135 140 

Val Gly Gly Ser Thr Leu Cys Val Arg Ser Gly Gly Gly Ser Gly Gly 
145 150 155 160 

Gly Ser Gly Gly Gly Ser Gly Gly Gly Arg Ala Pro Thr Tyr Arg Ala 
165 170 175 

Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys Ser Leu Glu Gin Val Arg 
180 185 190 

Lys lie Gin Gly Asp Gly Ala Ala Leu Gin Glu Lys Leu Cys Ala Thr 
195 200 205 

Tyr Lys Leu Cys His Pro Glu Glu Leu Val Leu Leu Gly His Ser Leu 
210 215 220 

Gly lie Pro Trp Ala Pro Leu Ser Ser Cys Pro Ser Gin Ala Leu Gin 
225 230 235 240 

Leu Ala Gly Cys Leu Ser Gin Leu His Ser Gly Leu Phe Leu Tyr Gin 
245 250 255 

Gly Leu Leu Gin Ala Leu Glu Gly lie Ser Pro Gl u Leu Gly Pro Thr 
260 265 270 

Leu Asp Thr Leu Gin Leu Asp Val Ala Asp Phe Ala Thr Thr lie Trp 
275 280 285 

Gin Gin Met Glu Glu Leu Gly Met Ala Pro Ala Leu Gin Pro Thr Gin 
■ 290 295 300 



Gly Ala Met Pro Ala Phe Ala Ser Ala Phe Gin Arg Arg Ala Gly Gly 
305 310 315 320 



Val Leu Val Ala Ser His Leu Gin Ser Phe Leu Glu Val Ser Tyr Arg 
325 330 335 

Val Leu Arg His Leu Ala Gin Pro 
340 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1047 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: 1 . . 63 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 64.. 1047 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1047 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

ATG GAG CTG ACT GAA TTG CTC CTC GTG GTC ATG CTT CTC CTA ACT GCA 4 8 

Met Glu Leu Thr Glu Leu Leu Leu Val Val Met Leu Leu Leu Thr Ala 

-21 -20 -15 -10 

AGG CTA ACG CTG TCC AGC CCG GCT CCT CCT GCT TGT GAC CTC CGA GTC 96 

Arg Leu Thr Leu Ser Ser Pro Ala Pro Pro Ala Cys Asp Leu Arg Val 
-5 15 10 

CTC AGT AAA CTG CTT CGT GAC TCC CAT GTC CTT CAC AGC AGA CTG AGC 144 
Leu Ser Lys Leu Leu Arg Asp Ser His Val Leu His Ser Arg Leu Ser 
15 20 25 

CAG TGC CCA GAG GTT CAC CCT TTG CCT ACA CCT GTC CTG CTG CCT GCT 192 
Gin Cys Pro Glu Val His Pro Leu Pro Thr Pro Val Leu Leu Pro Ala 
30 35 40 

GTG GAC TTT AGC TTG GGA GAA TGG AAA ACC CAG ATG GAG GAG ACC AAG 240 
Val Asp Phe Ser Leu Gly Glu Trp Lys Thr Gin Met Glu Glu Thr Lys 
45 50 55 



GCA CAG GAC ATT CTG GGA GCA GTG ACC CTT CTG CTG GAG GGA GTG ATG 288 

Ala Gin Asp lie Leu Gly Ala Val Thr Leu Leu Leu Glu Gly Val Met 

60 65 70 75 

GCA GCA CGG GGA CAA CTG GGA CCC ACT TGC CTC TCA TCC CTC CTG GGG 336 

Ala Ala Arg Gly Gin Leu Gly Pro Thr Cys Leu Ser Ser Leu Leu Gly 

80 85 90 

CAG CTT TCT GGA CAG GTC CGT CTC CTC CTT GGG GCC CTG CAG AGC CTC 384 

Gin Leu Ser Gly Gin Val Arg Leu Leu Leu Gly Ala Leu Gin Ser Leu 

95 100 105 

CTT GGA ACC CAG CTT CCT CCA CAG GGC AGG ACC ACA GCT CAC AAG GAT 432 

Leu Gly Thr Gin Leu Pro Pro Gin Gly Arg Thr Thr Ala His Lys Asp 

110 115 120 

CCC AAT GCC ATC TTC CTG AGC TTC CAA CAC CTG CTC CGA GGA AAG GTG 4 80 

Pro Asn Ala lie Phe Leu Ser Phe Gin His Leu Leu Arg Gly Lys Val 

125 130 135 

CGT TTC CTG ATG CTT GTA GGA GGG TCC ACC CTC TGC GTA CGG CGG GCG 528 

Arg Phe Leu Met Leu Val Gly Gly Ser Thr Leu Cys Val Arg Arg Ala 

140 145 150 155 

CCA ACA TAT CGC GCC TCG AGT CTA CCA CAG AGC TTC CTT TTA AAA AGC 576 

Pro Thr Tyr Arg Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys Ser 

160 165 170 

TTA GAG CAA GTG AGG AAG ATC CAG GGC GAT GGC GCA GCG CTC CAG GAG 624 

Leu Glu Gin Val Arg Lys lie Gin Gly Asp Gly Ala Ala Leu Gin Glu 

175 180 185 

AAG CTG TGT GCC ACC TAC AAG CTG TGC CAC CCC GAG GAG CTG GTG CTG 672 

Lys Leu Cys Ala Thr Tyr Lys Leu Cys His Pro Glu Glu Leu Val Leu 

190 195 200 

CTC GGA CAC TCT CTG GGC ATC CCC TGG GCT CCC CTG AGC AGC TGC CCC • 720 

Leu Gly His Ser Leu Gly lie Pro Trp Ala Pro Leu Ser Ser Cys Pro 

205 210 215 

AGC CAG GCC CTG CAG CTG GCA GGC TGC TTG AGC CAA CTC CAT AGC GGC 7 68 

Ser Gin Ala Leu Gin Leu Ala Gly Cys Leu Ser Gin Leu His Ser Gly 

220 225 230 235 

CTT TTC CTC TAC CAG GGG CTC CTG CAG GCC CTG GAA GGG ATC TCC CCC 816 

Leu Phe Leu Tyr Gin Gly Leu Leu Gin Ala Leu Glu Gly lie Ser Pro 

240 245 250 

GAG TTG GGT CCC ACC TTG GAC ACA CTG CAG CTG GAC GTC GCC GAC TTT 864 

Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala Asp Phe 

255 260 265 

GCC ACC ACC ATC TGG CAG CAG ATG GAA GAA CTG GGA ATG GCC CCT GCC 912 

Ala Thr Thr He Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro Ala 

270 275 280 

CTG CAG CCC ACC CAG GGT GCC ATG CCG GCC TTC GCC TCT GCT TTC CAG 960 



Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala Phe Gin 
285 290 295 



CGC CGG GCA GGA GGG GTC CTA GTT GCC TCC CAT CTG CAG AGC TTC CTG 

Arg Arg Ala Gly Gly Val Leu Val Ala Ser His Leu Gin Ser Phe Leu 
300 305 310 315 

GAG GTG TCG TAC CGC GTT CTA CGC CAC CTT GCC CAG CCC 

Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin Pro 
320 325 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Met Glu Leu Thr Glu Leu Leu Leu Val Val Met Leu Leu Leu Thr Ala 
-21 -20 -15 -10 



Arg Leu Thr Leu 
-5 

Leu Ser Lys Leu 
15 

Gin Cys Pro Glu 
30 

Val Asp Phe Ser 
45 

Ala Gin Asp lie 
60 

Ala Ala Arg Gly 



Gin Leu Ser Gly 
95 

Leu Gly Thr Gin 
110 

Pro Asn Ala lie 
125 

Arg Phe Leu Met 
140 



Ser Ser Pro Ala 
1 

Leu Arg Asp Ser 



Val His Pro Leu 
35 

Leu Gly Glu Trp 
50 

Leu Gly Ala Val 
65 

Gin Leu Gly Pro 
80 

Gin Val Arg Leu 



Leu Pro Pro Gin 
115 

Phe Leu Ser Phe 
130 

Leu Val Gly Gly 
145 



Pro Pro Ala Cys 
5 

His Val Leu His 
20 

Pro Thr Pro Val 



Lys Thr Gin Met 
55 

Thr Leu Leu Leu 
70 

Thr Cys Leu Ser 
85 

Leu Leu Gly Ala 
100 

Gly Arg Thr Thr 



Gin His Leu Leu 
135 

Ser Thr Leu Cys 
150 



Asp Leu Arg Val 
10 

Ser Arg Leu Ser 
25 

Leu Leu Pro Ala 
40 

Glu Glu Thr Lys 



Glu Gly Val Met 
75 

Ser Leu Leu Gly 
90 

Leu Gin Ser Leu 
105 

Ala His Lys Asp 
120 

Arg Gly Lys Val 



Val Arg Arg Ala 
155 



Pro Thr Tyr Arg Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys Ser 
160 165 170 



Leu Glu Gin Val Arg Lys lie Gin Gly Asp Gly Ala Ala Leu Gin Glu 
175 180 185 

Lys Leu Cys Ala Thr Tyr Lys Leu Cys His Pro Glu Glu Leu Val Leu 
190 195 200 

Leu Gly His Ser Leu Gly lie Pro Trp Ala Pro Leu Ser Ser Cys Pro 
205 210 215 

Ser Gin Ala Leu Gin Leu Ala Gly Cys Leu Ser Gin Leu His Ser Gly 
220 225 230 235 

Leu Phe Leu Tyr Gin Gly Leu Leu Gin Ala Leu Glu Gly lie Ser Pro 
240 245 250 

Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala Asp Phe 
255 260 265 

Ala Thr Thr He Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro Ala 
270 275 280 

Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala Phe Gin 
285 290 295 

Arg Arg Ala Gly Gly Val Leu Val Ala Ser His Leu Gin Ser Phe Leu 
300 305 310 315 

Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin Pro 
320 325 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1083 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE.: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: 1 . . 63 

(ix) FEATURE: 

(A) NAME/ KEY : mat_peptide 

(B) LOCATION: 64.. 1083 

(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 1..1083 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

ATG GAG CTG ACT GAA TTG CTC CTC GTG GTC ATG CTT CTC CTA ACT GCA 
Met Glu Leu Thr Glu Leu Leu Leu Val Val Met Leu Leu Leu Thr Ala 
-21 -20 -15 -10 

AGG CTA ACG CTG TCC AGC CCG GCT CCT CCT GCT TGT GAC CTC CGA GTC 
Arg Leu Thr Leu Ser Ser Pro Ala Pro Pro Ala Cys Asp Leu Arg Val 
-5 15 10 



CTC AGT AAA CTG CTT CGT GAC TCC CAT GTC CTT CAC AGC AGA CTG AGC 

Leu Ser Lys Leu Leu Arg Asp Ser His Val Leu His Ser Arg Leu Ser 

15 20 25 

CAG TGC CCA GAG GTT CAC CCT TTG CCT ACA CCT GTC CTG CTG CCT GCT 

Gin Cys Pro Glu Val His Pro Leu Pro Thr Pro Val Leu Leu Pro Ala 

30 35 40 

GTG GAC TTT AGC TTG GGA GAA TGG AAA ACC CAG ATG GAG GAG ACC AAG 

Val Asp Phe Ser Leu Gly Glu Trp Lys Thr Gin Met Glu Glu Thr Lys 

45 50 55 

GCA CAG GAC ATT CTG GGA GCA GTG ACC CTT CTG CTG GAG GGA GTG ATG 

Ala Gin Asp lie Leu Gly Ala Val Thr Leu Leu Leu Glu Gly Val Met 

60 65 70 75 

GCA GCA CGG GGA CAA CTG GGA CCC ACT TGC CTC TCA TCC CTC CTG GGG 

Ala Ala Arg Gly Gin Leu Gly Pro Thr Cys Leu Ser Ser Leu Leu Gly 

80 85 90 

CAG CTT TCT GGA CAG GTC CGT CTC CTC CTT GGG GCC CTG CAG AGC CTC 

Gin Leu Ser Gly Gin Val Arg Leu Leu Leu Gly Ala Leu Gin Ser Leu 

95 100 105 

CTT GGA ACC CAG CTT CCT CCA CAG GGC AGG ACC ACA GCT CAC AAG GAT 

Leu Gly Thr Gin Leu Pro Pro Gin Gly Arg Thr Thr Ala His Lys Asp 

110 115 120 

CCC AAT GCC ATC TTC CTG AGC TTC CAA CAC CTG CTC CGA GGA AAG GTG 

Pro Asn Ala lie Phe Leu Ser Phe Gin His Leu Leu Arg Gly Lys Val 

125 130 135 

CGT TTC CTG ATG CTT GTA GGA GGG TCC ACC CTC TGC GTC AGG GGT GGC 

Arg Phe Leu Met Leu Val Gly Gly Ser Thr Leu Cys Val Arg Gly Gly 

140 145 150 155 

GGT TCT GGA GGT GGT TCC GGA GGG GGT TCT AGA GCA CCA ACA TAT CGC 

Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Arg Ala Pro Thr Tyr Arg 

160 165 170 

GCC TCG AGT CTA CCA CAG AGC TTC CTT TTA AAA AGC TTA GAG CAA GTG 

Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys Ser Leu Glu Gin Val 

175 180 185 



AGG AAG ATC CAG GGC GAT GGC GCA GCG CTC CAG GAG AAG CTG TGT GCC 



Arg Lys lie Gin Gly Asp Gly Ala Ala Leu Gin Glu Lys Leu Cys Ala 
190 195 200 



ACC TAC AAG CTG TGC CAC CCC GAG GAG CTG GTG CTG CTC GGA CAC TCT 72 0 

Thr Tyr Lys Leu Cys His Pro Glu Glu Leu Val Leu Leu Gly His Ser 

205 210 215 

CTG GGC ATC CCC TGG GCT CCC CTG AGC AGC TGC CCC AGC CAG GCC CTG 7 68 

Leu Gly lie Pro Trp Ala Pro Leu Ser Ser Cys Pro Ser Gin Ala Leu 

220 225 230 235 

CAG CTG GCA GGC TGC TTG AGC CAA CTC CAT AGC GGC CTT TTC CTC TAC 816 

Gin Leu Ala Gly Cys Leu Ser Gin Leu His Ser Gly Leu Phe Leu Tyr 

240 245 250 

CAG GGG CTC CTG CAG GCC CTG GAA GGG ATC TCC CCC GAG TTG GGT CCC 8 64 

Gin Gly Leu Leu Gin Ala Leu Glu Gly lie Ser Pro Glu Leu Gly Pro 

255 260 265 

ACC TTG GAC ACA CTG CAG CTG GAC GTC GCC GAC TTT GCC ACC ACC ATC 912 

Thr Leu Asp Thr Leu Gin Leu Asp Val Ala Asp Phe Ala Thr Thr lie 

270 275 280 

TGG CAG CAG ATG GAA GAA CTG GGA ATG GCC CCT GCC CTG CAG CCC ACC 9 60 

Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro Ala Leu Gin Pro Thr 

285 290 295 

CAG GGT GCC ATG CCG GCC TTC GCC TCT GCT TTC CAG CGC CGG GCA GGA 1008 

Gin Gly Ala Met Pro Ala Phe Ala Ser Ala Phe Gin Arg Arg Ala Gly 

300 305 310 315 

GGG GTC CTA GTT GCC TCC CAT CTG CAG AGC TTC CTG GAG GTG TCG TAC 1056 

Gly Val Leu Val Ala Ser His Leu Gin Ser Phe Leu Glu Val Ser Tyr 

320 325 330 

CGC GTT CTA CGC CAC CTT GCC CAG CCC 1083 

Arg Val Leu Arg His Leu Ala Gin Pro 

335 340 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 361 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Met Glu Leu Thr Glu Leu Leu Leu Val Val Met Leu Leu Leu Thr Ala 
-21 -20 -15 -10 



Arg Leu Thr Leu Ser Ser Pro Ala Pro Pro Ala Cys Asp Leu Arg Val 
-5 15 10 



Leu Ser Lys Leu Leu Arg Asp Ser His Val Leu His Ser Arg Leu Ser 
15 20 25 

Gin Cys Pro Glu Val His Pro Leu Pro Thr Pro Val Leu Leu Pro Ala 
30 35 40 

Val Asp Phe Ser Leu Gly Glu Trp Lys Thr Gin Met Glu Glu Thr Lys 
45 50 55 

Ala Gin Asp lie Leu Gly Ala Val Thr Leu Leu Leu Glu Gly Val Met 
60 65 70 75 

Ala Ala Arg Gly Gin Leu Gly Pro Thr Cys Leu Ser Ser Leu Leu Gly 
80 85 90 

Gin Leu Ser Gly Gin Val Arg Leu Leu Leu Gly Ala Leu Gin Ser Leu 
95 100 105 

Leu Gly Thr Gin Leu Pro Pro Gin Gly Arg Thr Thr Ala His Lys Asp 
110 115 120 

Pro Asn Ala lie Phe Leu Ser Phe Gin His Leu Leu Arg Gly Lys Val 
125 130 135 

Arg Phe Leu Met Leu Val Gly Gly Ser Thr Leu Cys Val Arg Gly Gly 
140 145 150 155 

Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Arg Ala Pro Thr Tyr Arg 
160 165 170 

Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys Ser Leu Glu Gin Val 
175 180 185 

Arg Lys lie Gin Gly Asp Gly Ala Ala Leu Gin Glu Lys Leu Cys Ala 
190 195 200 

Thr Tyr Lys Leu Cys His Pro Glu Glu Leu Val Leu Leu Gly His Ser 
205 210 215 

Leu Gly lie Pro Trp Ala Pro Leu Ser Ser Cys Pro Ser Gin Ala Leu 
220 225 230 235 

Gin Leu Ala Gly Cys Leu Ser Gin Leu His Ser Gly Leu Phe Leu Tyr 
240 245 250 

Gin Gly Leu Leu Gin Ala Leu Glu Gly lie Ser Pro Glu Leu Gly Pro 
255 260 265 

Thr Leu Asp Thr Leu Gin Leu Asp Val Ala Asp Phe Ala Thr Thr lie 
270 275 280 

Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro Ala Leu Gin Pro Thr 
285 290 295 

Gin Gly Ala Met Pro Ala Phe Ala Ser Ala Phe Gin Arg Arg Ala Gly 
300 305 310 315 



Gly Val Leu Val Ala Ser His Leu Gin Ser Phe Leu Glu Val Ser Tyr 
320 325 330 



Arg Val Leu Arg His Leu Ala Gin Pro 
335 340 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1095 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(ix) FEATURE: 

(A) NAME/KEY: mat__peptide 

(B) LOCATION: 64.. 1095 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. . 1095 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

ATG GAG CTG ACT GAA TTG CTC CTC GTG GTC ATG CTT CTC CTA ACT GCA 4 8 

Met Glu Leu Thr Glu Leu Leu Leu Val Val Met Leu Leu Leu Thr Ala 
-21 -20 -15 -10 

AGG CTA ACG CTG TCC AGC CCG GCT CCT CCT GCT TGT GAC CTC CGA GTC 96 
Arg Leu Thr Leu Ser Ser Pro Ala Pro Pro Ala Cys Asp Leu Arg Val 
-5 15 10 

CTC AGT AAA CTG CTT CGT GAC TCC CAT GTC CTT CAC AGC AGA CTG AGC 144 
Leu Ser Lys Leu Leu Arg Asp Ser His Val Leu His Ser Arg Leu Ser 
15 20 25 

CAG TGC CCA GAG GTT CAC CCT TTG CCT ACA CCT GTC CTG CTG CCT GCT 192 
Gin Cys Pro Glu Val His Pro Leu Pro Thr Pro Val Leu Leu Pro Ala 
30 35 40 

GTG GAC TTT AGC TTG GGA GAA TGG AAA ACC CAG ATG GAG GAG ACC AAG 24 0 

Val Asp Phe Ser Leu Gly Glu Trp Lys Thr Gin Met Glu Glu Thr Lys 
45 50 55 

GCA CAG GAC ATT CTG GGA GCA GTG ACC CTT CTG CTG GAG GGA GTG ATG 2 88 

Ala Gin Asp lie Leu Gly Ala Val Thr Leu Leu Leu Glu Gly Val Met 
60 65 70 75 

GCA GCA CGG GGA CAA CTG GGA CCC ACT TGC CTC TCA TCC CTC CTG GGG 336 
Ala Ala Arg Gly Gin Leu Gly Pro Thr Cys Leu Ser Ser Leu Leu Gly 
80 85 90 



CAG CTT TCT GGA CAG GTC CGT CTC CTC CTT GGG GCC CTG CAG AGC CTC 384 

Gin Leu Ser Gly Gin Val Arg Leu Leu Leu Gly Ala Leu Gin Ser Leu 
95 100 105 

CTT GGA ACC CAG CTT CCT CCA CAG GGC AGG ACC ACA GCT CAC AAG GAT 432 

Leu Gly Thr Gin Leu Pro Pro Gin Gly Arg Thr Thr Ala His Lys Asp 

110 115 120 

CCC AAT GCC ATC TTC CTG AGC TTC CAA CAC CTG CTC CGA GGA AAG GTG 480 

Pro Asn Ala lie Phe Leu Ser Phe Gin His Leu Leu Arg Gly Lys Val 
125 130 135 

CGT TTC CTG ATG CTT GTA GGA GGG TCC ACC CTC TGC GTA CGG TCC GGA 528 

Arg Phe Leu Met Leu Val Gly Gly Ser Thr Leu Cys Val Arg Ser Gly 
140 145 150 155 

GGT GGC TCT GGC GGT GGT TCT GGT GGC GGC TCC GGA GGC GGT CGT GCG 57 6 

Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Arg Ala 
160 165 170 

CCA ACA TAT CGC GCC TCG AGT CTA CCA CAG AGC TTC CTT TTA AAA AGC 624 

Pro Thr Tyr Arg Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys Ser 
175 180 185 

TTA GAG CAA GTG AGG AAG ATC CAG GGC GAT GGC GCA GCG CTC CAG GAG 672 

Leu Glu Gin Val Arg Lys lie Gin Gly Asp Gly Ala Ala Leu Gin Glu 

190 195 200 

AAG CTG TGT GCC ACC TAC AAG CTG TGC CAC CCC GAG GAG CTG GTG CTG 72 0 

Lys Leu Cys Ala Thr Tyr Lys Leu Cys His Pro Glu Glu Leu Val Leu 
205 210 215 

CTC GGA CAC TCT CTG GGC ATC CCC TGG GCT CCC CTG AGC AGC TGC CCC 7 68 

Leu Gly His Ser Leu Gly lie Pro Trp Ala Pro Leu Ser Ser Cys Pro 
220 225 230 235 

AGC CAG GCC CTG CAG CTG GCA GGC TGC TTG AGC CAA CTC CAT AGC GGC 816 

Ser Gin Ala Leu Gin Leu Ala Gly Cys Leu Ser Gin Leu His Ser Gly 
240 245 250 

CTT TTC CTC TAC CAG GGG CTC CTG CAG GCC CTG GAA GGG ATC TCC CCC 864 

Leu Phe Leu Tyr Gin Gly Leu Leu Gin Ala Leu Glu Gly lie Ser Pro 
255 260 265 

GAG TTG GGT CCC ACC TTG GAC ACA CTG CAG CTG GAC GTC GCC GAC TTT 912 

Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala Asp Phe 

270 275 280 

GCC ACC ACC ATC TGG CAG CAG ATG GAA GAA CTG GGA ATG GCC CCT GCC 960 

Ala Thr Thr lie Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro Ala 
285 290 295 

CTG CAG CCC ACC CAG GGT GCC ATG CCG GCC TTC GCC TCT GCT TTC CAG 1008 

Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala Phe Gin 
300 305 310 315 

CGC CGG GCA GGA GGG GTC CTA GTT GCC TCC CAT CTG CAG AGC TTC CTG 1056 



Arg Arg Ala Gly Gly Val Leu Val Ala Ser His Leu Gin Ser Phe Leu 
320 325 330 



GAG GTG TCG TAC CGC GTT CTA CGC CAC CTT GCC CAG CCC 1095 
Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin Pro 
335 340 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 365 amino acids 

(B) TYPE: amino acid 
{ D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Met Glu Leu Thr Glu Leu Leu Leu Val Val Met Leu Leu Leu Thr Ala 
-21 -20 -15 -10 

Arg Leu Thr Leu Ser Ser Pro Ala Pro Pro Ala Cys Asp Leu Arg Val 
-5 1 5 10 

Leu Ser Lys Leu Leu Arg Asp Ser His Val Leu His Ser Arg Leu Ser 
15 20 25 

Gin Cys Pro Glu Val His Pro Leu Pro Thr Pro Val Leu Leu Pro Ala 
30 35 40 

Val Asp Phe Ser Leu Gly Glu Trp Lys Thr Gin Met Glu Glu Thr Lys 
45 50 55 

Ala Gin Asp lie Leu Gly Ala Val Thr Leu Leu Leu Glu Gly Val Met 
60 65 70 75 

Ala Ala Arg Gly Gin Leu Gly Pro Thr Cys Leu Ser Ser Leu Leu Gly 
80 85 90 

Gin Leu Ser Gly Gin Val Arg Leu Leu Leu Gly Ala Leu Gin Ser Leu 
95 100 105 

Leu Gly Thr Gin Leu Pro Pro Gin Gly Arg Thr Thr Ala His Lys Asp 
110 115 120 

Pro Asn Ala lie Phe Leu Ser Phe Gin His Leu Leu Arg Gly Lys Val 
125 130 135 

Arg Phe Leu Met Leu Val Gly Gly Ser Thr Leu Cys Val Arg Ser Gly 
140 145 150 155 

Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Arg Ala 
160 165 170' 



Pro Thr Tyr Arg Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys Ser 
175 180 185 



Leu Glu Gin Val Arg Lys lie Gin Gly Asp Gly Ala Ala Leu Gin Glu 
190 195 . 200 



Lys Leu Cys Ala Thr Tyr Lys Leu Cys His Pro Glu Glu Leu Val Leu 
205 210 215 

Leu Gly His Ser Leu Gly lie Pro Trp Ala Pro Leu Ser Ser Cys Pro 
220 225 230 235 

Ser Gin Ala Leu Gin Leu Ala Gly Cys Leu Ser Gin Leu His Ser Gly 
240 245 250 

Leu Phe Leu Tyr Gin Gly Leu Leu Gin Ala Leu Glu Gly lie Ser Pro 
255 260 265 

Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala Asp Phe 
270 275 280 

Ala Thr Thr lie Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro Ala 
285 290 295 

Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala Phe Gin 
300 305 310 315 

Arg Arg Ala Gly Gly Val Leu Val Ala Ser His Leu Gin Ser Phe Leu 
320 325 330 

Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin Pro 
335 340 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA H 



(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: 27. .44 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CTCTCCAAGC TTGAATTCCG GCCAGAATGG AGCTGACTGA ATTG 
(2) INFORMATION FOR SEQ ID NO: 11: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GTAGAGGTAC CGCGGCCGCT TACCCTTCCT GAGACAGATT CTGGGAG 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
TGAACCTCTG GGCACTGGCT CAGT 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GCTGCCTGCT GTGGACTTTA GCTT 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1. .24 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

TGT TGG AAG CTC AGG AAG ATG GCA 
Cys Trp Lys Leu Arg Lys Met Ala 
1 5 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Cys Trp Lys Leu Arg Lys Met Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1..24 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

CCT GAT GCT TGT AGG AGG GTC CAC 
Pro Asp Ala Cys Arg Arg Val His 
1 5 



(2) INFORMATION FOR SEQ ID NO: 17: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Pro Asp Ala Cys Arg Arg Val His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .24 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

TCA AGA GTT CGT GTA TCC TGT TCA 24 
Ser Arg Val Arg Val Ser Cys Ser 
1 5 



(2) INFORMATION FOR SEQ ID NO: 19: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Ser Arg Val Arg Val Ser Cys Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 24 base pairs 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1. .24 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 



GAA TGG AAC TCG TGG ACT CTT TCC 24 
Glu Trp Asn Ser Trp Thr Leu Ser 
1 5 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Glu Trp Asn Ser Trp Thr Leu Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 22: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

GTAAAACGAC GGCCAGT 17 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 
,(A) LENGTH: 17 base pairs 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
C AG GAAAC AG CTATGAC 17 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
{C} STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 

Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Arg 
15 10 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: join (1.. 3, 43.. 66) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

TGC TCTAGAACCG CCTCCGGAAC CACCTCCAGA ACCGCCACC CCT GAC GCA GAG 54 

Cys Pro Asp Ala Glu 
1 5 



GGT GGA CCC TCC 
Gly Gly Pro Ser 



66 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Cys Pro Asp Ala Glu Gly Gly Pro Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) * LOCATION: 22. .45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

GGTTCCGGAG GCGGTTCTAG A GCA CCA ACA TAT CGC GCC TCG AGT 4 5 

Ala Pro Thr Tyr Arg Ala Ser Ser 
1 5 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 

Ala Pro Thr Tyr Arg Ala Ser Ser 
1 5 



(2) INFORMATION FOR SEQ ID NO: 29: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
CATTCCGCGG GGTACCGCGG CCGCTCAGGG CTGGGCAAGG TGGCGTAG 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1..24 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

GGC TGC TTG AGC CAA CTC CAT AGC 
Gly Cys Leu Ser Gin Leu His Ser 
1 5 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Gly Cys Leu Ser Gin Leu His Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 32: 



(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 1. .24 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

GAC CCA ACT CGG GGG AGA TCC CTT 24 
Asp Pro Thr Arg Gly Arg Ser Leu 
1 5 



(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Asp Pro Thr Arg Gly Arg Ser Leu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(ix) FEATURE: 

(A) NAME/KEY: mutation 

(B) LOCATION: replace (25, ) 

(ix) FEATURE: 

(A) NAME/KEY: mutation 

(B) LOCATION: replace ( 33 .. 34 , "") 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 34: 



TAGACTCGAG GCGCGATATG TTGGCGCCCG CCGTACGCAG AGGGTGGACC CTCCTAC 



(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: -SEQ ID NO: 35: 

Ser Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly 
15 10 15 

Arg 



(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..6 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 7 . . 57 

(D) OTHER INFORMATION: /product= "linker peptide" 

(ix) FEATURE: 

(A) NAME/ KEY: mis c_f eature 

(B) LOCATION: 1..5 

(D) OTHER INFORMATION: /product^ "SplI" 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION: 7 . . 12 

(D) OTHER INFORMATION: /product^ "Mrol" 

(ix) FEATURE: 

(A) NAME/KEY: misc feature 



(B) LOCATION: 43.. 48 

(D) OTHER INFORMATION: /product^ "Mrol" 

(ix) FEATURE: 

(A) NAME /KEY: misc_f eature 

(B) LOCATION: 58.. 61 

(D) OTHER INFORMATION: /product= "Bbel" 

(ix) FEATURE: 

(A) NAME/KEY: mutation 

(B) LOCATION: replace ( 4 . . 5 , "") 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 



GTA CGG TCCGGAGGTG GCTCTGGCGG TGGTTCTGGT GGCGGCTCCG GAGGCGGTCG 
Val Arg 
1 

TGCGC 



(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Val Arg 
1 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(ix) FEATURE: 

(A) NAME/KEY: mis c_f eature 

(B) LOCATION: 1..51 

(D) OTHER INFORMATION: /product= "linker peptide 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 53 



(D) OTHER INFORMATION: /product= "SplI" 

(ix) FEATURE: 

(A) NAME/ KEY: misc_f eature 

(B) LOCATION: 10. . 15 

(D) OTHER INFORMATION: /product= "Mrol" 

(ix) FEATURE: 

(A) NAME/ KEY : mis c_f eature 

(B) LOCATION: 46.. 51 

(D) OTHER INFORMATION: /product= "Mrol" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
ACGACCGCCT CCGGAGCCGC CACCAGAACC ACCGCCAGAG CCACCTCCGG ACC 
(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 175 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY: miscjeature 

(B) LOCATION: 1 

(D) OTHER INFORMATION: Xaa = Met 



(xi) SEQUENCE DE; 

Xaa Thr Pro Leu 
1 

Lys Cys Leu Glu 
20 

Gin Glu Lys Leu 
35 

Val Leu Leu Gly 
50 

Cys Pro Ser Gin 
65 

Ser Gly Leu Phe 



CRIPTION: SEQ T 

Gly Pro Ala Ser 
5 

Gin Val Arg Lys 



Cys Ala Thr Tyr 
40 

His Ser Leu Gly 
55 

Ala Leu Gin Leu 
70 

Leu Tyr Gin Gly 
85 



i NO: 39: 

Ser Leu Pro Gin 
10 

lie Gin Gly Asp 
25 

Lys Leu Cys His 



lie Pro Trp Ala 
60 

Ala Gly Cys Leu 
75 

Leu Leu Gin Ala 
90 



Ser Phe Leu Leu 
15 

Gly Ala Ala Leu 
30 

Pro Glu Gly Leu 
45 

Pro Leu Ser Ser 



Ser Gin Leu His 
80 

Leu Glu Gly lie 
95 



Ser Pro Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala 
100 105 110 



Asp Phe Ala Thr Thr lie Trp Gin Gin Met Glu Glu Leu Gly Met Ala 
115 120 125 



Pro Ala Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala 
130 135 140 

Phe Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser His Leu Gin Ser 
145 150 155 160 

Phe Leu Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin Pro 
165 170 175 

INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 332 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 



Ser Pro Ala Pro Pro Ala Cys Asp Leu Arg Val Leu Ser Lys Leu Leu 
15 10 15 

Arg Asp Ser His Val Leu His Ser Arg Leu Ser Gin Cys Pro Glu Val 
20 25 30 

His Pro Leu Pro Thr Pro Val Leu Leu Pro Ala Val Asp Phe Ser Leu 
35 40 45 

Gly Glu Trp Lys Thr Gin Met Glu Glu Thr Lys Ala Gin Asp lie Leu 
50 55 60 

Gly Ala Val Thr Leu Leu Leu Glu Gly Val Met Ala Ala Arg Gly Gin 
65 70 75 80 



Leu Gly Pro Thr Cys 
85 

Val Arg Leu Leu Leu 
100 

Pro Pro Gin Gly Arg 
115 

Leu Ser Phe Gin His 
130 

Val Gly Gly Ser Thr 



Leu Ser Ser Leu Leu Gly 
90 

Gly Ala Leu Gin Ser Leu 
105 

Thr Thr Ala His Lys Asp 
120 

Leu Leu Arg Gly Lys Val 
135 

Leu Cys Val Arg Arg Ala 



Gin Leu Ser Gly Gin 
95 

Leu Gly Thr Gin Leu 
110 

Pro Asn Ala lie Phe 
125 

Arg Phe Leu Met Leu 
140 

Pro Pro Thr Thr Ala 



145 



150 



155 



160 



Val Pro Ser Arg Thr Ser Leu Val Leu Thr Leu Asn Glu Leu Pro Asn 
165 170 175 

Arg Thr Ser Gly Leu Leu Glu Thr Asn Phe Thr Ala Ser Ala Arg Thr 
180 185 190 

Thr Gly Ser Gly Leu Leu Lys Trp Gin Gin Gly Phe Arg Ala Lys lie 
195 200 205 

Pro Gly Leu Leu Asn Gin Thr Ser Arg Ser Leu Asp Gin lie Pro Gly 
210 215 220 

Tyr Leu Asn Arg lie His Glu Leu Leu Asn Gly Thr Arg Gly Leu Phe 
225 230 235 240 

Pro Gly Pro Ser Arg Arg Thr Leu Gly Ala Pro Asp lie Ser Ser Gly 
245 250 255 

Thr Ser Asp Thr Gly Ser Leu Pro Pro Asn Leu Gin Pro Gly Tyr Ser 
260 265 270 

Pro Ser Pro Thr His Pro Pro Thr Gly Gin Tyr Thr Leu Phe Pro Leu 
275 280 285 

Pro Pro Thr Leu Pro Thr Pro Val Val Gin Leu His Pro Leu Leu Pro 
290 295 300 

Asp Pro Ser Ala Pro Thr Pro Thr Pro Thr Ser Pro Leu Leu Asn Thr 
305 310 315 320 

Ser Tyr Thr His Ser Gin Asn Leu Ser Gin Glu Gly 
325 330 

INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

Ser Gly Gly Gly Arg 
1 5 

INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 



(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 



Ser Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly 
1 5 10 15 



(2) INFORMATION FOR SEQ ID NO: 43: 

(i) 'SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

Ser Gly Gly Gly 
1 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser 
15 10 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 



(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

Gly Gly Gly Ser Gly Gly Gly Ser 
1 5 



INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 

Thr Pro Leu Gly Pro Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys 
15 10 15 

Cys Leu Glu Gin Val Arg Lys lie Gin Gly Asp Gly Ala Ala Leu Gin 
20 25 30 

Glu Lys Leu Cys Ala Thr Tyr Lys Leu Cys His Pro Glu Gly Leu Val 
35 40 45 

Leu Leu Gly His Ser Leu Gly lie Pro Trp Ala Pro Leu Ser Ser Cys 
50 55 60 

Pro Ser Gin Ala Leu Gin Leu Ala Gly Cys Leu Ser Gin Leu His Ser 
65 70 75 80 

Gly Leu Phe Leu Tyr Gin Gly Leu Leu Gin Ala Leu Glu Gly lie Ser 
85 90 95 

Pro Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala Asp 
100 105 110 

Phe Ala Thr Thr lie Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro 
115 120 125 

Ala Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala Phe 
130 135 140 

Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser His Leu Gin Ser Phe 
145 150 155 160 



Leu Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin Pro 
165 170 



